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Abstract 

Lattice reduction algorithms, such as the LLL algorithm, have been proposed as preprocessing tools 
in order to enhance the performance of suboptimal receivers in MIMO communications. 
In this paper we introduce a new kind of lattice reduction-aided decoding technique, called augmented 
lattice reduction, which recovers the transmitted vector directly from the change of basis matrix, 
and therefore doesn't entail the computation of the pseudo-inverse of the channel matrix or its QR 
decomposition. 

We prove that augmented lattice reduction attains the maximum receive diversity order of the channel; 
simulation results evidence that it significantly outperforms LLL-SIC detection without entailing any 
additional complexity. A theoretical bound on the complexity is also derived. 

Index Terms: lattice reduction-aided decoding, LLL algorithm, right preprocessing. 

I. Introduction 

Multiple-input multiple-output (MIMO) systems can provide high data rates and reliability over 
fading channels. In order to achieve optimal performance, maximum likelihood decoders such as 
the Sphere Decoder may be employed; however, their complexity grows prohibitively with the 
number of antennas and the constellation size, posing a challenge for practical implementation. 
On the other hand, suboptimal receivers such as zero forcing (ZF) or successive interference 



cancellation (SIC) do not preserve the diversity order of the system [|ll|] . Right preprocessing 
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using lattice reduction has been proposed in order to enhance their performance 



m. 



In particular, the classical LLL algorithm for lattice reduction, whose average complexity is 
polynomial in the number of antennaij , has been proven to achieve the optimal receive diversity 



order in the spatial multiplexing case [|17n . Very recently, it has also been shown that combined 
with regularization techniques such as MMSE-GDFE left preprocessing, lattice reduction-aided 
decoding is optimal in terms of diversity-multiplexing tradeoff Isl]. However, the shift between 
the error probability of ML detection and LLL-ZF (respectively, LLL-SIC) detection increases 



greatly for a large number of antennas 111311 . 

In this paper we present a new kind of LLL-aided decoding, called augmented lattice reduction, 
which doesn't require ZF or SIC receivers and therefore doesn't entail the computation of the 
pseudo-inverse of the channel matrix or its QR decomposition. 

In the coherent case, MIMO decoding amounts to solving an instance of the closest vector 
problem (CVP) in a finite subset of the lattice generated by the channel matrij^. Following an 
idea of Kannan lioll . our strategy is to reduce the CVP to the shortest vector problem (SVP) by 
embedding the n-dimensional lattice generated by the channel matrix into an (n+ l)-dimensional 
lattice. We show that for a suitable choice of the embedding, the transmitted message can be 
recovered directly from the coordinates of the shortest vector of the augmented lattice. 
In general, the LLL algorithm is not guaranteed to solve the SVP; however, it certainly finds the 
shortest vector in the lattice in the particular case where the minimum distance is exponentially 
smaller than the other successive minima. Equivalently, we can say that "the LLL algorithm 
is an SVP-oracle when the lattice gap is exponential in the lattice dimension". An appropriate 
choice of the embedding ensures that this condition is satisfied. 

Thanks to this property, we can prove that our method also achieves the receive diversity 
of the channel. Numerical simulations evidence that augmented lattice reduction significantly 
outperforms LLL-SIC detection without entailing any additional complexity. A theoretical (albeit 
pessimistic) bound on the complexity is also derived. 

'Note that the worst-case number of iterations of the LLL algorithm applied to the MIMO context is unbounded, as has 
been proved in However, the tail probability of the number of iterations decays exponentially, so that in many cases high 



complexity events can be regarded as negligible with respect to the target error rate (see (8|], Theorem 3) 
^Actually, LLL-ZF and LLL-SIC suboptimal decoding correspond to two classical techniques for finding ; 
of the CVP, due to Babai: the rounding algorithm and nearest plane algorithm respectively fl]. 
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This paper is organized as follows: in Section |II] we introduce the system model and basic 
notions concerning lattice reduction, and summarize the existing lattice reduction-aided decoding 
schemes. In Section UlI] we describe augmented lattice reduction decoding, and in Section HVl we 
analyze its performance and complexity, both theoretically and through numerical simulations. 



A. System model and notation 

We consider a MIMO system with M transmit and N receive antennas such that M < N 
using spatial multiplexing. The complex received signal is given by 



where Xc G C^^, yc, Wc G C^, He G M7Vxa/(C). The transmitted vector Xc belongs to a finite 
constellation S C Z[z]^^; the entries of the channel matrix He are supposed to be i.i.d. complex 
Gaussian random variables with zero mean and variance per real dimension equal to \, and Wc 
is the Gaussian noise with i.i.d. entries of zero mean and variance A'^o- We consider the coherent 
case where He is known at the receiver. 

Separating the real and imaginary part, the model can be rewritten as 



IL Preliminaries 



Yc = 



HcXc + w, 



(1) 



y = Hx + w 



(2) 



in terms of the real-valued vectors 




and of the equivalent real channel matrix 




Here n = 2N,m = 2M. 



The maximum likelihood decoded vector is given by 



Xml = argmin ||HeXe - Yc 



argmin ||Hx — y|| , 



where 



denotes the Euclidean norm. 
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B. Lattice reduction 

An m-dimensional real lattice in is the set of points 

£(H) = {Hx I X G Z™}, 

where H G M„xm(IR)- We denote by (in the minimum distance of the lattice, that is the smallest 
norm of a nonzero vector in i2(H). More generally, for all 1 < i < m one can define the i-th 
successive minimum of the lattice as follows: 

Aj(H) = inf{r > | 3 vi, Vj linearly independent in £(H) s.t. ||vj|| < r Vj < i} 

We recall that two matrices H, H' generate the same lattice if and only if H' = HU with U 
unimodular. 

Lattice reduction algorithms allow to find a new basis H' for a given lattice such that the 

basis vectors are shorter and nearly orthogonal. Orthogonality can be measured by the absolute 
value of the coefficients /Xi^ in the Gram-Schmidt orthogonalization of the basis, see the GSO 
Algorithm [B 

Algorithm 1: GSO (Gram-Schmidt orthogonalization) 
h* ^ hi 

for i = 2, . . . , m do 

for j = 1 , . . . , z — 1 do 

h„h-) 



II J I 



end 



h* ^ h - u- h* 



end 



We recall the following useful property of GSO: the length of the smallest of the Gram- 
Schmidt vectors h* is always less or equal to the minimum distance dn of the lattice nl5\\ . In 
other words, 

du > a(H) = min ||h*|| (3) 

l<i<m 
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A basis H is said to be LLL-reduced [|14l1 if its Gram- Schmidt coefficients /Xj and Gram- 
Schmidt vectors satisfy the following properties: 

1) Size reduction: 



2) Lovasz condition: 



< ^, 1 < / < A; < m, 



|hfc + /^fe,fc-ih^„i||^ > 6 ||h^_J|\ 1 < < m, 



where 5 G (|, l) (a customary choice is 5 = |). 

The LLL algorithm is summarized in Algorithm [2l Given a full-rank matrix H e Mnxmi^, it 
computes an LLL-reduced version Hred = HU, with U G Mmxm(^) unimodular, and outputs 
the columns {hj} and {u,/} of Hrcd and U respectively. 

We list here some properties of LLL-reduced bases that we will need in the sequel. First of all, 
the LLL algorithm finds at least one basis vector whose length is not too far from the minimum 
distance c/h of the lattice. The following inequality holds for any m-dimensional LLL-reduced 



|hi|| < a 2 c/h, (4) 



basis H O 



where a = ^3^74 {a = 2 if 6 = |). 



Moreover, the first basis vector cannot be too big compared to the Gram-Schmidt vectors {h*}: 

llhill < a"^ ||h*|| , VI < i < m. 



In particular, if j = argmin]^<j<„ ||h: 



ill' 



T — 111 II 7 — 1 77i 1 

< ||hi|| < a— \\h*\\ = a— a(H) < a" a(H). (5) 



C. Lattice reduction-aided decoding 

In this section we briefly review existing detection schemes which use the LLL algorithm to 
preprocess the channel matrix, in order to improve the performance of suboptimal decoders such 



as ZF or SIC UJ, UJ, IJ ■ 

Let Hrcd = HU be the output of the LLL algorithm on H. We can rewrite the received vector 
as y = Hi.cdU~^x + w. 
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Algorithm 2: The LLL algorithm 



Compute the GSO of H 

while A; < m do 

RED(k,k-l) 

if ||h^ + /ifc,fc_ih^_i||^ < 5 ||h^_i|f then 
swap hfc and hfc_i 
swap Ufc and Wk-i 
update GSO 

A; ^ max(/c — 1,2) 

end 
else 

for / = /c — 2, . . . , 1 do 
I RED(k,l) 

end 

k^k + l 

end 

end 



Algorithm 3: Size reduction RED(k,l) 
if > \ then 

hfc ^ hfc - [/ifc^i] hi 

Ufc ^ Ufc - [/ifc^i] 

for j = 1 — 1 do 

end 
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The LLL-ZF decoder outputs 



^LLL^ZF = U 



where H 



t 

red 



H^jHrcd) is the Moore-Penrose pseudoinverse of Hred, L'l denotes 



componentwise rounding to the nearest integer and Qs is a quantization function that forces 
the solution to belong to the constellation S. 
• The LLL-SIC decoder performs the QR decomposition Hj-cd = QR-, computes y = Q^y, 
finds by recursion x defined by 



2 = m — 1, . . . , 1, 



and finally outputs ^lll-sic = Qs (Ux). 



III. Augmented Lattice Reduction 

We propose here a new decoding technique based on the LLL algorithm which, unlike the 
LLL-ZF and LLL-SIC decoders, does not require the inversion of the channel matrix at the last 
stage. Let y be the (real) received vector in the model Q. Consider the (n + 1) x (m + 1) 
augmented matrix 



H 




t 



'"a.m 





t I 



(6) 



V 

where t > is a parameter to be determined. The points of the augmented lattice i3(H) are of 
the form 

Hx' - gy 

qt 



x' G Z™, g G Z 



In particular, the vector v 



belongs to the augmented lattice. We will 



Hx - y \ _ / w 

' )~v. 

show that for a suitable choice of the parameter t, and supposing that the noise is small enough, 
V is the shortest vector in the lattice and the LLL algorithm finds this vector. That is, ±v is the 
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first column of Hred = HU, the output of LLL algorithm on H. Clearly, since H is full-rank 
with probability 1, in this case the first column of the change of basis matrix U is . Thus 

we can "read" the transmitted message directly from the change of basis matrix U. 

To summarize, in order to decode we can perform the LLL algorithm on H, and given the output 

Hrcd = HU, we can choose 

1 



X = 



(7) 



where U = (uij). 

The previous decoder can be improved by including all the columns of Hj-cd in the search for 
the vector v. Specifically, let 

Uk = ^ — {ui,k, ■ ■ ■ ,Um,kV , k = l,...,m. 

If there exists some /c G {1, . . . ,m} such that \um+i,k\='^, we define 

kmin = argmin ||Hufc - y|| , 



A; s.t. \u, 



m+l,fc 



otherwise /Cmin = 1 • Then the Augmented Lattice Reduction decoder outputs 

xalr = Qs ( [ufe^inl ) , (8) 

IV. Performance 

A. Diversity 

In this paragraph we will investigate the performance of augmented lattice reduction. We begin 
by proving that our method, like LLL-ZF and LLL-SIC, attains the maximum receive diversity 
gain of N, for an appropriate choice of the parameter t in (|6]). The diversity gain (i of a decoding 
scheme is defined as follows: 



p-s>oo log(p) 

where Pf. denotes the error probability as a function of the signal to noise ratio p. 

Proposition 1. If the augmented lattice reduction is performed using t = £a(Hred), where 
a(Hi.cd) is the length of the smallest vector in the Gram-Schmidt orthogonalization o/Hred, and 
e < ^ — r, then it achieves the maximum receive diversity N. 
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Remark. It is essential to use a(Hred) in place of a(H). In fact, for general bases H that 
are not LLL-reduced, there is no lower bound of the type ([5]) limiting how small the smallest 
Gram-Schmidt vector can be. For a(Hrcd), putting together the bounds ([3]) and ([5]), we obtain 

< a(Hred) < dn (9) 

a 2 

Note that the LLL reduction of H does not entail any additional complexity, since it is the same 
as the LLL reduction on the first m columns of H. In fact the parameter t can be chosen during 
the LLL reduction of H, after carrying out the LLL algorithm on the first m columns. 

In order to prove the previous Proposition, we will show that in the (m + 1) -dimensional 
lattice >C(H) there is an exponential gap between the first two successive minima. Then, using 
the estimate (H)) on the norm of the first vector in an LLL-reduced basis, one can conclude that 
in this particular case the LLL algorithm finds the shortest vector in the lattice i2(H) with high 
probability. This, in turn, allows to recover the closest lattice vector Hx to y in C(ii) supposing 
that the noise w is small enough. 

The following definition makes the notion of "gap" more precise: 

Definition. Let w be a shortest nonzero vector in the lattice >C(H), and let 7 > L v is called 
7-unique z/Vu G >C(H), 

||u||<7||v|| ^ u, V are linearly dependent. 

We now prove the existence of such a gap under suitable conditions: 

Lemma 1. Let H be the matrix defined in (0), and let t = £:a(Hi.cd), with e < • 

Suppose that \\w\\ = ||y — Hx|| < edu. 

(Hx — y \ m ~ 

is an a 2 -unique shortest vector of >C(H). 
' ) 

Remark. Observe that the hypothesis on \\w\\ implies in particular that \\w\\ < ^ and Hx is 
indeed the closest lattice point to y. 

Proof: We need to show that any vector u G >C(H) that is not a multiple of v must have 
length greater than ||v||. 
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/ Hx' - qy , 

By contradiction, suppose that 3u = | G >C(H) linearly independent from v such 

that ||u|| < ||v|| . Since ||u|| > \q\t, 

II II — 1 1 I 

, , u a 2 V 

|g| < U < 



t - t 

On the other side, ||u|| < ||v|| implies that also ||Hx' — gy|| < ||v||. Consider 
||Hx' — gHx|| = ||Hx' — gy|| + ||gy — gHxH < 

— II II 

m m II II ^ V 

< a 2 ||v|| + \q\ ||y — Hx|| < a 2 ||v|| H ||w|| < 



<«t,/||w||VtMi + ^) (10) 



The bound ^ on a(Hrcd) implies 

e 

—^du <t < edu- 

a 2 

Using this inequality and the hypotheses on ||w|| and e, we can bound the expression (fTOl) with 



a 2 



/ — 7 / m — 1 \ / — m_ m — 1 

V2edii ( 1 + a 2 1 < 2v 2£aHa 2^2 < rfn- 



Thus II Hx' — gHx|| < d^. But this is a contradiction because Hx'— gHx G >C(H) and is nonzero 
since v and u are linearly independent. Therefore v is a ^-unique. (Since the last coordinate of 
V in the basis H is 1, v cannot be a nontrivial multiple of another lattice vector.) □ 

Remark. The lower bound on t is essential to ensure that |g| is bounded. If |g| were unbounded, 
clearly ||Hx' — gy|| might be arbitrarily small and there might exist u G C(H.) of smaller norm 
than V. 

Lemma 2. Under the hypotheses of Lemma U\ the augmented lattice reduction methods ([7|) and 
dS]) correctly decode the transmitted signal x. 



Proof: Let Hj-cd = HU denote the output of the LLL reduction of H, and let hi = H 
be its first column. The property (HI) of LLL reduction in dimension m + 1 entails that hi 



(1 



< 
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means that hi and v are linearly dependent; equivalently, 3a, b E Z\{0} such that av + 6hi = 0. 
In particular at + bqt = 0, that is a = —bq and hi = gv. Then by definition of H, 



hi = H 



gx 



This means that the first column of the reduction matrix U is j ^ j , and so xalr = L"il ) = 

Qs (gx/g) = X and the augmented lattice reduction methods ^ and ([8]) correctly decode the 
transmitted message. 

(Observe that this is possible only if |g| = 1, since det(U) is also a multiple of q and U is 
unimodular.) □ 

Thus for any channel realization H, we have the following bound on the error probability for 
the augmented lattice reduction method: 

Pe,ALR(H) < P{||w|| >ed^}. 

To conclude the proof of Proposition [H we need to show that given e < \;;zri'^ we have 

lim -l"S^{l|w|l>^M > ^ 

p->oo log p 

This turns out to be true. In fact, it has been shown in \y\ (Proof of Theorem 2), that for any 
constant cm depending only on the number of transmit antenna]^ 

P{||w|| > CMd^} < ^^^<P)f^' for AT = M, 

P{||w|| > cmc?h} < ^ forAr>M. 

Thus we have shown that augmented lattice reduction achieves the maximum receive diversity 
with the choice t = £:a(Hrcd)- 

^This result was used in IttIi in order to prove that the LLL-ZF decoder achieves the receive diversity order. The proof in 
Inll actually refers to the complex model (TJ, but the statement also holds for the real model since = dii^, ||w|| = ||wc||. 
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B. Simulation results 

Figure \T\ shows the performance of augmented lattice reduction for an uncoded 6x6 MIMO 
system using 16-QAM constellations. 

Two versions of augmented lattice reduction with different values of the parameter e are com- 
pared. Clearly it is preferable to choose e as big as possible in order to minimize the probability 

P{||w|| > edu}- Version 1 corresponds to the choice e = \;;z~!', the highest value of e that 

verifies the hypothesis of Proposition [T] At the SER of 2 • 10"^, its performance is within 2.5 dB 
from ML decoding and gains 1.5 dB with respect to LLL-SIC decoding. 

Version 2 corresponds to a value of e optimized by computer search (experimentally, this is 
around 2^^), whose performance is within 2.2 dB of ML decoding at the SER of 2 ■ 10^"^. From 
now on, we will always consider this optimized version. For higher values of e, we are not 
able to prove that the LLL algorithm finds the shortest lattice vector in /^(H). However, it is 
well-known that the LLL algorithm performs much better on average than the theoretical bounds 
predict. 

In order to further reduce the distance from ML decoding, one can add MMSE-GDFE pre- 
processing, which yields a better conditioned channel matrix. Figure |2] shows the comparison of 
augmented lattice reduction with LLL-SIC detection, both using MMSE-GDFE preprocessing. 
At the SER of lO^*^, augmented lattice reduction is within only 0.4 dB from ML performance 
and gains 2.3 dB with respect to LLL-SIC decoding. 

The gain with respect to LLL-SIC decoding increases with the number of antennas: it is 3.5 dB 
for an 8 X 8 MIMO system, at the SER of 10~^. On the other side, augmented lattice reduction 
is still within 0.8 dB from ML performance (see Figure [3]). 

C. Comparison with Kim and Park's "Improved Lattice Reduction" 

A lattice-reduction aided detection technique based on an augmented matrix similar to ^ 



(after MMSE-GDFE preprocessing) has been proposed in lll2ll . However, the philosophy behind 



the method of 11211 is quite different: the parameter t is chosen in such a way that the Lovasz 
condition on the last column of the augmented matrix is always verified. Specifically, considering 
the QR decomposition Urcd = QR of the LLL-reduced matrix Hj-cd, the condition t > rm,m is 
required. In general, this results in a much bigger value of the parameter t. Thus the transmitted 
message is detected from the last vector of the reduced augmented basis instead of the smallest 
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basis vector. 

On one side, this guarantees that the complexity increase is trivial because the only step required 
after reducing H is size reduction on the last column. On the other side, unlike our exponential 
gap technique, there is no guarantee that LLL reduction can find the required lattice vector. As 
a consequence, the performance of the decoder described in lll2ll is not as good, especially as 
the number of antennas increases; in fact it is about the same as LLL-SIC (see Figure O. The 
authors then propose to use a quantization error correction to improve the performance, which 
requires an additional computational cost, and is not needed in our case. 



V. Complexity 

In this section we propose to estimate the additional complexity required by augmented lattice 
reduction with respect to LLL-ZF and LLL-SIC decoding. We are interested in the complexity 
order as a function of the number of transmit and receive antennas. 

A. Theoretical bounds 

The complexity of LLL reduction of a gaussian channel matrix H has been studied in [j91]. 
As we have seen in Section HIl every instance of the LLL-ZF (respectively LLL-SIC) decoder 
consists of three main phases: 

1) A full Gram-Schmidt orthogonalization is performed at the beginning of the LLL algorithm. 
This requires 0{nm?') elementary operations [[TD. 

2) The main "while" loop of the LLL algorithm requires 0{m?) elementary operations for 
each iteration. The number -R'(H) of iterations of the LLL algorithm for a fixed realization 



H of the channel is bounded by 



i\:(H) < m^og^ ( ^^^^ ) + m, (11) 



where A(H) and a(H) denote respectively the maximum and minimum norm of the Gram- 
Schmidt vectors of H. For general H, A'(H) can be arbitrarily large. However, it was shown 
in \^ that E(i^(H)) ~ O (m^ In (^z^)). 
3) Finally, the ZF and SIC receiver entail respectively the multiplication by the pseudo-inverse 
of Hrcd and its QR decomposition. Both have complexity order O(nm^) jvl]. 
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For fixed H, we can use the estimate (fTTl) to obtain a bound of the number of iterations of the 
LLL reduction of H. The Gram-Schmidt orthogonalization of H yields 

^1 ■ ■ ■ 0„xi 
■■• t 

(-y\ 

In fact, the last Gram-Schmidt vector is the projection of on the subspace 

V'/ 

(span(h^, . . . , h;;,))"^ D (span(ei, . . . , e„))"^ = span(e„+i). 

Therefore 

a(H) > min(t,a(H)) = min(£:a(Hi.ed), «(H)). 

LLL reduction increases the minimum of the Gram-Schmidt vectors ||5|], so a(Hi.cd) > a(H), 
and a(H) > ea{H). On the other side t < a(Hred) < ^(Hrcd) < ^(H) and so A(H) = 
max(t, A(H)) = A(H). Then 

K(H) < (m + if log^ I ^ffl I + m + 1 < 

^ Va(H)y 

(m + l)2, M(H)\ 
c vea(H) / 

^ ' ' -Ine + ln ) M + m + 1, 



a H 



where c = log^^. Following we can estimate the average E[i^(H)], recalling that < 



V~5- 

/c(H), the condition number of H, and that [|20 



We thus obtain 



Tfl 

E[lnA;(H)l < In ( ) + 2.24. 



E[if(H)] < ^ ' (-ln£ + E[fc(H)]) 1 < 

c 



(m + 1)2 / f m \ \ 

<- ^ -Ine + ln +2.24 +m+l. (12) 

c \ \n — m + 1 J J 

For the choice e = ^— t, the complexity of the main loop of the LLL algorithm using the 

new method is at most of the order of Oim?). 
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B. Simulation results 

Our complexity simulations evidence the fact that the upper bounds (fTTI) and (fT2l) on the 
average number of iterations of the LLL algorithm for LLL-aided linear decoding and the 
augmented lattice reduction method are both quite pessimistic. The number of iterations for 
both methods appears in fact to be almost linear in practice, see Figure IH 
We have chosen 5 = | in all the numerical simulations. 

While the number of iterations of LLL is indeed higher, approximately by a factor 2, for the 
augmented lattice reduction (Figure H]), the total complexity expressed in flopclis about the same 
for LLL-SIC and the augmented lattice method (see Figure [5]). The additional complexity of the 
LLL algorithm is balanced out by the complexity savings due to the fact that QR decomposition 
is not needed. 

C. Complex LLL reduction 

A generalization of the LLL algorithm to complex lattices has been studied in fl6^ and applied 
to MIMO decoding in [^. It has been show experimentally in that the complex versions of 
LLL-ZF and LLL-SIC decoding have essentially the same performance of their real counterparts 
but with substantially reduced complexity. 

A complex version of the augmented lattice reduction can be implemented by LLL-reducing the 
{N + 1) X {M + 1) -dimensional matrix 

^ He -y 

OlxAT t 

and allows to save about 40% of computational costs (see Figure ^ without any change in 
performance. 

VI. Conclusions 

In this paper, we introduced a new kind of lattice-reduction aided decoding which does not 
require a linear or decision-feedback receiver at the last stage. We proved that this method attains 
the maximum receive diversity order. Simulation results evidence that the new technique has a 
substantial performance gain with respect to the classical LLL-ZF and LLL-SIC decoders, while 
having approximately the same complexity order as LLL-SIC. 

''Here we define a "flop" as any floating-point operation (addition, multiplication, division or square root). 



He 
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MIMO 6x6, spatial multiplexing, 16-QAM 




Figure 1. Performance comparison of augmented lattice reduction with LLL-ZF and LLL-SIC detection for a 6 x 6 uncoded 
MIMO system using 16-QAM. The LLL algorithm is performed using 5 = |. 



MIMO 6x6, spatial multiplexing, 16-QAM 




Figure 2. Performance comparison of augmented lattice reduction with LLL-ZF, LLL-SIC and Improved Lattice Reduction 
with MMSE-GDFE preprocessing for a 6 x 6 uncoded MIMO system using 16-QAM. 
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Figure 3. Performance comparison of augmented lattice reduction with LLL-ZF and LLL-SIC detection with MMSE-GDFE 
preprocessing for a 8 x 8 uncoded MIMO system using 16-QAM. 




Figure 4. Average number of steps of the LLL algorithm as a function of the number n of transmit and receive antennas. 
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Average number of flops, SNR=12 




-LLL-ZF 

-LLL-SIC 

-ALR 

-Sphere Decoder 



Figure 5. Complexity comparison (in flops) of augmented lattice reduction with LLL-ZF, LLL-SIC and sphere decoding as a 
function of the number n of transmit and receive antennas, at SNR — 12, using 16-QAM constellations. 
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Figure 6. Complexity comparison (in flops) of the real and complex version of augmented lattice reduction as a function of the 
number n of transmit and receive antennas, at SNR — 12. Here we suppose that complex addition and complex multiplication 
require respectively 2 and 6 real flops. 
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